The Troubling Aspects of a Building Block Hypothesis for Genetic Programming

نویسندگان

  • Una-May O'Reilly
  • Franz Oppacher
چکیده

In this paper we rigorously formulate the Schema Theorem for Genetic Programming (GP). This involves defining a schema, schema order, and defining length and accounting for the variable length and the non-homologous nature of GP's representation. The GP Schema Theorem and the related notion of a GP Building Block are used to constlUct a testable hypothetical account of how GP' searches by hierarchically combining building blocks. Since building blocks need to have consistent above average fitness and compactness, and since the term in the GP Schema Theorem that expresses compactness is a random variable, the proposed account of GP search behavior is based on empirically questionable statistical assumptions. In particular, low Valiance in schema fitness is questionable because the performance of a schema depends in a highly sensitive manner on the context provided by the prograInS in which it is found. GP crossover is likely to change this context from one generation to the next which results in high variance in observed schema fitness. Low variance in compactness seems fortuitous rather than assured in GP because schema-containing prograInS change their sizes essentially at random. Introduction Based upon the Schema Theorem [Holland 1975], the Building Block Hypothesis [Holland'92, Go1dberg'89] is the original account of the search power in Genetic AlgOlithms (GAs). Holland's work is fundaInenta1 analysis, and from it more precise or clarified explanations (some diverging from a schema-based approach) of GA search behavior have been pursued. Some experimental [Mitchell, FOlTest et al. 1991; FOtTest and Mitchell 1992] and theoretical [Grefenstette and Baker 1989; Radcliffe 1991; Altenberg 1994] research has even diminished the value of the Building Block Hypothesis as a description of how GAs search or the source of the GA's power. For eXaInple, it is unclear which classes of search functions (even those with building block sllucture) the GA will solve faster than other seal'ch techniques such as hill clinIbing [Mitchell, Holland et al. 1993]. The Schema Theorem and Building Block Hypothesis were but the start of the process of detailed analysis of GAs. Genetic Programming (GP) [Koza 1992] is "a (relatively) new kid on the evolution-based algorithms block". The enthusiasm to apply GP has outpaced the attention paid to explaining it as a seal'Ch technique. Koza [Koza 1992] presents but a brief sketchy analogy with the GA Schema Theorem and Building Block Hypothesis. Another approach based upon population genetics analysis [Altenberg 1994] has great potential however the assumptions of the simplified "generation 0" model raise the SaIne unresolved clUcia1 issues we shall present here. In this paper we go through the exercise of rigorously formulating the Schema Theorem for GP. To do so involves defining a schema, schema order, and defining length and accounting

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchically Consistent Test Problems for Genetic Algorithms

The Building-Block Hypothesis suggests that the genetic algorithm (GA) will perform well when it is able to identify above-average-fitness low-order schemata and recombine them to produce higher-order schemata of higher fitness. We suppose that the recombinative process continues recursively, combining schemata of successively higher orders as search progresses. Historically, attempts to illust...

متن کامل

Application of Genetic Programming to Modeling and Prediction of Activity Coefficient Ratio of Electrolytes in Aqueous Electrolyte Solution Containing Amino Acids

Genetic programming (GP) is one of the computer algorithms in the family of evolutionary-computational methods, which have been shown to provide reliable solutions to complex optimization problems. The genetic programming under discussion in this work relies on tree-like building blocks, and thus supports process modeling with varying structure. In this paper the systems containing amino ac...

متن کامل

Bedload transport predictions based on field measurement data by combination of artificial neural network and genetic programming

Bedload transport is an essential component of river dynamics and estimation of its rate is important to many aspects of river management. In this study, measured bedload by Helley- Smith sampler was used to estimate the bedload transport of Kurau River in Malaysia. An artificial neural network, genetic programming and a combination of genetic programming and a neural network were used to estim...

متن کامل

Bedload transport predictions based on field measurement data by combination of artificial neural network and genetic programming

Bedload transport is an essential component of river dynamics and estimation of its rate is important to many aspects of river management. In this study, measured bedload by Helley- Smith sampler was used to estimate the bedload transport of Kurau River in Malaysia. An artificial neural network, genetic programming and a combination of genetic programming and a neural network were used to estim...

متن کامل

Mathematical Programming Models for Solving Unequal-Sized Facilities Layout Problems - a Generic Search Method

 This paper present unequal-sized facilities layout solutions generated by a genetic search program named LADEGA (Layout Design using a Genetic Algorithm). The generalized quadratic assignment problem requiring pre-determined distance and material flow matrices as the input data and the continuous plane model employing a dynamic distance measure and a material flow matrix are discussed. Computa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994